Joint position-pitch extraction from multichannel audio

نویسندگان

  • Michael Wohlmayr
  • Marián Képesi
چکیده

Recently, a method for joint extraction of pitch and location information from two-channel recordings has been introduced. This framework offers a new, natural representation of all acoustic sources in the auditory scene, and has potential to be used as front-end in applications such as advanced tracking of multiple speakers in conference rooms. In this paper, we explore basic properties of this method and propose improvements in performance by using circular arrangements of multiple microphones.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast and Robust Realtime Speaker Tracking Using Multichannel Audio and a Particle Filter

In this work a method to track the azimuth (horizontal angle) from multiple speakers in a typically reverberant real office environment is presented. The steered-response-power algorithm (SRP-PHAT) or the recently published joint position and pitch extraction approach (PoPi) combined with a sequential Monte Carlo estimation leads to a robust and fast tracker for audio indexing. One intention of...

متن کامل

Exploring pitch and timbre through 3d spaces: embodied models in virtual reality as a basis for performance systems design

Our paper builds on an ongoing collaboration between theorists and practitioners within the computer music community, with a specific focus on three-dimensional environments as an incubator for performance systems design. In particular, we are concerned with how to provide accessible means of controlling spatialization and timbral shaping in an integrated manner through the collection of perfor...

متن کامل

Audio Melody Extraction for Mirex 2009

This paper describes our submission to the audio melody extraction evaluation addressing the task of identifying the melody pitch contour from polyphonic musical audio. It shall give an overview about the algorithm and a discussion of the evaluation results. The presented algorithm is a derivative of our submission to MIREX’06. Major changes between the two versions are highlighted and the impa...

متن کامل

Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features

In this paper, we propose the use of spatial and harmonic features in combination with long short term memory (LSTM) recurrent neural network (RNN) for automatic sound event detection (SED) task. Real life sound recordings typically have many overlapping sound events, making it hard to recognize with just mono channel audio. Human listeners have been successfully recognizing the mixture of over...

متن کامل

Multiple Fundamental Frequency Extraction for Mirex

This extended abstract outlines an efficient approach for the extraction of multiple fundamental frequencies (F0) from polyphonic musical audio. The algorithm consists of three analysis steps. At first a multi-resolution spectral analysis is performed on the audio signal. Then, the most salient pitches are identified using a pitch extraction algorithm, which is designed to identify the predomin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007